Poster: Cross Cloud MapReduce: an Uncheatable MapReduce
نویسندگان
چکیده
MapReduce [1] is becoming a popular data processing application on Cloud Environment. However, security issues make many customers reluctant to move their critical computation tasks to cloud. For instance, [2] points out a real security vulnerability that the cloud service leader Amazon EC2 suffers from: some members of EC2 can create and share Amazon Machine Image (AMI) to the EC2 community so that other users can deploy their server by simply loading an AMI. A malicious AMI, if widely used, could flood the community with hundreds of infected virtual instances. On the other hand, in MapReduce where jobs are carried out via the collaboration of a number of computing nodes, merely one malicious node may render the overall results useless. In a traditional MapReduce setting where each node is deployed on the cloud, the integrity of a computation can be easily compromised and difficult to detect. In this work, we propose a new MapReduce Framework, Cross Cloud MapReduce (CCMR), which can be deployed among a single private cloud and multiple public clouds. By using the replication, hold-and-test, verification and credit-based trust management approaches, CCMR can eliminate malicious compute nodes and guarantee high computation accuracy while incurring acceptable overhead.
منابع مشابه
Cloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming
The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...
متن کاملCross-cloud MapReduce for Big Data
MapReduce plays a critical role as a leading framework for big data analytics. In this paper, we consider a geodistributed cloud architecture that provides MapReduce services based on the big data collected from end users all over the world. Existing work handles MapReduce jobs by a traditional computation-centric approach that all input data distributed in multiple clouds are aggregated to a v...
متن کاملTitle : IEEE Transactions on Cloud Computing Title of Paper : Cross - cloud MapReduce for Big Data
MapReduce plays a critical role as a leading framework for big data analytics. In this paper, we consider a geodistributed cloud architecture that provides MapReduce services based on the big data collected from end users all over the world. Existing work handles MapReduce jobs by a traditional computation-centric approach that all input data distributed in multiple clouds are aggregated to a v...
متن کاملA MR Simulator in Facilitating Cloud Computing
MapReduce is an enabling technology in support of Cloud Computing. Hadoop which is a mapReduce implementation has been widely used in developing MapReduce applications. This paper presents Hadoop simulatorHaSim, MapReduce simulator which builds on top of Hadoop. HaSim models large number of parameters that can affect the behaviors of MapReduce nodes, and thus it can be used to tune the performa...
متن کاملA Model-driven Approach for Price/Performance Tradeoffs in Cloud-based MapReduce Application Deployment
This paper describes preliminary work in developing a modeldriven approach to conducting price/performance tradeo s for Cloudbased MapReduce application deployment. The need for this work stems from the signi cant variability in both the MapReduce application characteristics and price/performance characteristics of the underlying cloud platform. Our approach involves a model-based machine learn...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012